LANCE: Piercing to the Heart of Instance Matching Tools

نویسندگان

  • Tzanina Saveta
  • Evangelia Daskalaki
  • Giorgos Flouris
  • Irini Fundulaki
  • Melanie Herschel
  • Axel-Cyrille Ngonga Ngomo
چکیده

One of the main challenges in the Data Web is the identification of instances that refer to the same real-world entity. Choosing the right framework for this purpose remains tedious, as current instance matching benchmarks fail to provide end users and developers with the necessary insights pertaining to how current frameworks behave when dealing with real data. In this paper, we present Lance, a domain-independent instance matching benchmark generator which focuses on benchmarking instance matching systems for Linked Data. Lance is the first Linked Data benchmark generator to support complex semantics-aware test cases that take into account expressive OWL constructs, in addition to the standard test cases related to structure and value transformations. Lance supports the definition of matching tasks with varying degrees of difficulty and produces a weighted gold standard, which allows a more fine-grained analysis of the performance of instance matching tools. It can accept any linked dataset and its accompanying schema as input to produce a target dataset implementing test cases of varying levels of difficulty. We provide a comparative analysis with Lance benchmarks to assess and identify the capabilities of state of the art instance matching systems as well as an evaluation to demonstrate the scalability of Lance’s test case generator.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

LANCE: A Generic Benchmark Generator for Linked Data

Identifying duplicate instances in the Data Web is most commonly performed (semi-)automatically using instance matching frameworks. However, current instance matching benchmarks fail to provide end users and developers with the necessary insights pertaining to how current frameworks behave when dealing with real data. In this demo paper, we present Lance, a domain-independent instance matching ...

متن کامل

How Well Does Your Instance Matching System Perform? Experimental Evaluation with LANCE

Identifying duplicate instances in the Data Web is most commonly performed (semi-)automatically using instance matching frameworks. However, current instance matching benchmarks fail to provide end users and developers with the necessary insights pertaining to how current frameworks behave when dealing with real data. In this paper, we present the results of the evaluation of instance matching ...

متن کامل

Breastfeeding or oral sucrose solution in term neonates receiving heel lance: a randomized, controlled trial.

OBJECTIVE The purpose of this work was to compare the efficacy of breastfeeding versus orally administered sucrose solution in reducing pain response during blood sampling through heel lance. METHODS; We conducted an open-label, randomized, controlled trial at a neonatal unit of a public hospital in northern Italy on 101 term neonates undergoing heel lance with an automated piercing device for ...

متن کامل

The Effect of Warm Compression Applied before Heel Lance on Pain Level, Comfort Level and Procedure Time in Healthy Term Newborns: A Randomized Clinical Trial

Background & aim: Warm compression is an effective method preferred in relieving pain. It enables procedures to be completed in a shorter time, and with less pain due to increasing blood flow in the area. This study aimed to investigate the effects of warm compress applied before heel lance on the procedure time, level of pain, and comfort level of healthy term newborn...

متن کامل

Instance matching benchmark for spatial data: a challenge proposal to OAEI

The number of datasets published in the Web of Data as part of the Linked Data Cloud is constantly increasing. The Linked Data paradigm is based on the unconstrained publication of information by different publishers, and the interlinking of Web resources across knowledge bases. In most cases, the cross-dataset links are not explicit in the dataset and must be automatically determined using Ins...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015